Where to Look and How to Describe: Fashion Image Retrieval With an Attentional Heterogeneous Bilinear Network

نویسندگان

چکیده

Fashion products typically feature in compositions of a variety styles at different clothing parts. In order to distinguish images fashion products, we need extract both appearance (i.e., “how describe”) and localization “where look”) information, their interactions. To this end, propose biologically inspired framework for image-based product retrieval, which mimics the hypothesized two-stream visual processing system human brain. The proposed attentional heterogeneous bilinear network (AHBN) consists two branches: deep CNN branch fine-grained attributes fully convolutional landmark information. A joint channel-wise attention mechanism is further applied extracted features focus on important channels, followed by compact pooling layer model interaction streams. Our achieves satisfactory performance three retrieval benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a comparison of teachers and supervisors, with respect to teacher efficacy and reflection

supervisors play an undeniable role in training teachers, before starting their professional experience by preparing them, at the initial years of their teaching by checking their work within the proper framework, and later on during their teaching by assessing their progress. but surprisingly, exploring their attributes, professional demands, and qualifications has remained a neglected theme i...

15 صفحه اول

An Inference Network Approach to Image Retrieval

Most image retrieval systems only allow a fragment of text or an example image as a query. Most users have more complex information needs that are not easily expressed in either of these forms. This paper proposes a model based on the Inference Network framework from information retrieval that employs a powerful query language that allows structured query operators, term weighting, and the comb...

متن کامل

NIST: An Image Classification Network to Image Semantic Retrieval

This paper proposes a classification network to image semantic retrieval (NIST) framework to counter the image retrieval challenge. Our approach leverages the successful classification network GoogleNet based on Convolutional Neural Networks to obtain the semantic feature matrix which contains the serial number of classes and corresponding probabilities. Compared with traditional image retrieva...

متن کامل

How do we know where to look ? Commentary on Chukoskie

متن کامل

Where to look next and what to look for

In [Sch 96c] we have introduced the use of Multidimensional Receptive Field Histograms for Probabilistic Object Recognition. In this paper we reverse the object recognition problem by asking the question, "where should we look?", when we want to verify the presence of an object, to track an object or to actively explore a scene. This paper describes the statistical framework from which we obtai...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2021

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2020.3034981